SemanticScuttle - klotz.me » Tags: simon willison

Tags: simon willison*

0 bookmark(s) - Sort by: Date ↓ / Title /

Feed a video to a vision LLM as a sequence of JPEG frames on the CLI (also LLM 0.25)

This article details a new plugin, llm-video-frames, that allows users to feed video files into long context vision LLMs (like GPT-4.1) by converting them into a sequence of JPEG frames. It showcases how to install and use the plugin, provides examples with the Cleo video, and discusses the cost and technical details of the process. It also covers the development of the plugin using an LLM and highlights other features in LLM 0.25.

2025-05-06 Tags: ffmpeg, llm, vision, video, jpeg, simon willison by klotz

Understanding the recent criticism of the Chatbot Arena

An analysis of the recent paper 'The Leaderboard Illusion' which critiques the Chatbot Arena's LLM evaluation methodology, focusing on issues with private testing, unfair sampling, and potential gaming of the leaderboard. It also explores OpenRouter as a potential alternative ranking system.

2025-05-01 Tags: llm, benchmarks, openrouter, chatbot arena, simon willison by klotz

Qwen 3 offers a case study in how to effectively release a model

Alibaba’s Qwen team released the Qwen 3 model family, offering a range of sizes and capabilities. The article discusses the model's features, performance, and the well-coordinated release across the LLM ecosystem, highlighting the trend of better models running on the same hardware.

2025-04-29 Tags: llm, qwen, mlx, ollama, reasoning, qwen 3, alibaba, simon willison by klotz

Start building with Gemini 2.5 Flash

Google's Gemini 2.5 Flash model is a new, faster, and more cost-effective model with adjustable 'thinking' capabilities. The article details how to use it with llm-gemini, explores pricing differences compared to Gemini 2.0 Flash, and shares example SVG outputs.

2025-04-18 Tags: gemini, 2.5 flash, llm, google, simon willison by klotz

Long context support in LLM 0.24 using fragments and template plugins

LLM 0.24 introduces fragments and template plugins to better utilize long context models, improving storage efficiency and enabling new features like querying logs by fragment and leveraging documentation. It also details improvements to template handling and model support.

2025-04-08 Tags: llm, context, simon willison by klotz

Qwen2.5-VL-32B: Smarter and Lighter

A review of the Qwen2.5-VL-32B large language model, noting its performance, capabilities, and how it runs on a 64GB Mac. Includes a demonstration with a map image and performance statistics.

2025-03-26 Tags: vision, llm, qwen, simon willison by klotz

Here’s how I use LLMs to help me write code

Simon Willison discusses his experience using Large Language Models (LLMs) for coding, providing detailed advice on how to effectively use LLMs to augment coding abilities, set reasonable expectations, manage context, and more.

2025-03-12 Tags: llm, coding, ai-assisted programming, simon willison by klotz

How To Use LLMs For Programming Tasks

A guide on using large language models (LLMs) for programming tasks, including examples, strategies, and useful tips for effectively using AI assistants like ChatGPT and Claude.

2025-03-12 Tags: hackaday, llm, simon willison, programming by klotz

Claude 3.7 Sonnet, extended thinking and long output, llm-anthropic 0.14

Simon Willison discusses the release of llm-anthropic 0.14, which adds support for Claude 3.7 Sonnet's new features. Key features include extended thinking mode, a massive increase in output limits, and improved support for long tasks. The article also covers the plugin's implementation details and limitations.

2025-02-25 Tags: claude, claude 3.7 sonnet, llm-anthropic, llm, simon willison by klotz

shot-scraper 1.6 with support for HTTP Archives

New release of shot-scraper CLI tool for taking screenshots and scraping web pages with support for HTTP Archive (HAR) files.

2025-02-14 Tags: scraper, http, har, playwright, simon willison by klotz

SemanticScuttle - klotz.me

Tags: simon willison*

Linked Tags

Related Tags